Acoustic parameters for automatic detection of nasal manner
نویسندگان
چکیده
Of all the sounds in any language, nasals are the only class of sounds with dominant speech output from the nasal cavity as opposed to the oral cavity. This gives nasals some special properties including presence of zeros in the spectrum, concentration of energy at lower frequencies, higher formant density, higher losses, and stability. In this paper we propose acoustic correlates for the linguistic feature nasal. In particular, we focus on the development of Acoustic Parameters (APs) which can be extracted automatically and reliably in a speaker independent way. These APs were tested in a classification experiment between nasals and semivowels, the two classes of sounds which together form the class of sonorant consonants. Using the proposed APs with a support vector machine based classifier we were able to obtain classification accuracies of 89.53%, 95.80% and 87.82% for prevocalic, postvocalic and intervocalic sonorant consonants respectively on the TIMIT database. As an additional proof to the strength of these parameters, we compared the performance of a Hidden Markov Model (HMM) based system that included the APs for nasals as part of the front-end, with an HMM system that did not. In this digit recognition experiment, we were able to obtain a 60% reduction in error rate on the TI46 database. 2004 Elsevier B.V. All rights reserved.
منابع مشابه
Nasal detection module for a knowledge-based speech recognition system
The Lexical Access From Features (LAFF) project tries to model the representation and perception of speech by human listeners. The derivation of such a representation involves first finding certain acoustic landmarks. Based on the landmarks and the acoustic cues surrounding the landmarks, distinctive features of the speech segments may be deciphered. The present study concentrates on the nasali...
متن کاملAssessment of septoplasty effectiveness using acoustic rhinometry and rhinomanometry
Introduction: Septal deviation is the chief cause of chronic nasal obstruction. In order to treat such cases, nasal septoplasty surgery is usually performed based on patient complaints and a surgeon's examination, both of which are subjective. This study aims at using the objective parameters of acoustic rhinometry and rhinomanometry to evaluate the effectiveness of septoplasty surgery. Ma...
متن کاملAutomatic detection of manner events based on temporal parameters
In this study, we investigated how well acoustic events extracted from a cross-spectral temporal measure could be used to classify the manner and voicing of consonants. In particular, we developed seven measures that look at the strength and time difference between various onsets and offsets of acoustic energy. Consistent with findings by Shannon et al. (1995), our classification results show t...
متن کاملThe Effect of Nasal Obstruction after Different Nasal Surgeries Using Acoustic Rhinometry and Nasal Obstruction Symptom Evaluation Scale
BACKGROUND The efficiency of nasal surgeries can be determined by objective or subjective methods. We have assessed the effect of nasal obstruction after different nasal surgeries using Acoustic Rhinometry (AR) and Nasal Obstruction Symptom Evaluation (NOSE) Scale. METHODS Between May 2011 and May 2012, 40 young adult patients and 10 healthy volunteers as control group who referred to ...
متن کاملAutomatic detection of landmark for nasal consonants from speech waveform
A knowledge-based approach towards automatically detecting nasal landmarks (/m/, /n/, and /ng/) from speech waveform is developed. The acoustic characteristics Fn1 locus calculated on each frame of speech waveform as the mass center of spectrum amplitude in the vicinity of the lowest spectral prominence between 150-1000Hz, and A23 locus calculated on the same speech frame as a band energy betwe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 43 شماره
صفحات -
تاریخ انتشار 2004